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Abstract 

We describe the structure of connected graphs with the minimum 
and maximum average distance, radius, diameter, betweenness cen- 
traUty, efficiency and resistance distance, given their order and size. 
We find tight bounds on these graph quahties for any arbitrary num- 
ber of nodes and edges and analytically derive the form and properties 
of such networks. 



1 Introduction 



Many complex systems can be described as interconnections of simpler ele- 
ments, which in turn can be analyzed abstractly as graphs. We are interested 
in the structural properties of these networks, regardless of the nature of their 
individual parts. This allows the results developed in this paper to be appli- 
cable in a wide range of different disciplines, such as neuroscience, biology, 
social sciences and engineering. The properties that are of general interest 
are the average distance, betweenness centrality, radius, diameter, efficiency 
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and the graph resistance. Depending on the apphcation we usually want to 
minimize or maximize one or more of the above, because they are directly 
implicated in some performance metric of the network. They are correlated 
with how fast the system responds to different input stimuli [1] and how ro- 
bust it is to the failure of individual subsystems, due to random failures or 
targeted attacks [2], [3], [4]. They also indicate how efficient message propaga- 
tion is across a network [5], how easy it is for dynamical processes that require 
global coordination and information flow (like synchronization or computa- 
tion) to take place, and how reliable a transmitted message is in the presence 
of noise [I]. Although these structural properties do not take into account 
the specifics of the various systems, focusing on the structural patterns of 
the network architecture can give a valuable insight on how to optimize the 
network function, while obeying other constraints. 

2 Preliminaries 

This section provides a brief introduction to the notions from graph theory 
that are used throughout this study. A graph (also called a network) is an 
ordered pair Q = (y,£) comprised of a set V = V{Q) of vertices together 
with a set £^ = £^(^) of edges that are unordered 2-element subsets of V. 
Two vertices u and v are called neighbors if they are connected through an 
edge ((u, v) G £) and we write u — v, otherwise we write u-/-v. All graphs in 
this article are simple, meaning that all edges connect two different vertices, 
there is at most one edge connecting any pair of vertices, and edges have no 
direction. The neighborhood Mu of a vertex u is the set of its neighbors. The 
degree of a vertex is the number of its neighbors. A vertex is said to have 
full degree if it is connected to every other vertex in the network. A network 
is assortative with respect to its degree distribution when the vertices with 
large degrees are connected to others that have large degrees. When vertices 
with small degrees connect to vertices with large degrees and vice versa, 
then the network is called disassortative. The order iV of a graph is the 
number of its vertices, N = \V\. A graph's size (denoted by m = \S\), is 
the number of its edges. We will denote a graph Q of order N and size 
m as Q{N,m) or simply QN,m- A complete graph is a graph in which each 
vertex is connected to every other. The edge density of a graph is defined as 
p = m/ (^) , representing the number of present edg fraction of largest 

possible number of edges, which is the size of a complete graph. A clique in 
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a graph is a subset of its vertices such that every vertex pair in the subset 
is connected. The clique order is the number of vertices that belong to it. 
A path is a sequence of consecutive edges in a graph and the length of the 
path is the number of edges traversed. The distance between two vertices 
u and V, usually denoted by d — d{u,v), is the length of the shortest path 
that connects these two vertices. A cycle is a closed (simple) path, with no 
other repeated vertices or edges other than the starting and ending nodes. 
A full cycle is a cycle that includes all the vertices of the network. A graph 
is connected if for every pair of vertices u and v, there is a path from u to v. 
Otherwise the graph is called disconnected. We will be focusing exclusively 
on connected graphs, given that every disconnected graph can be analyzed 
as the sum of its connected components. If the distance between u and v 
is equal to k, then these vertices are called k— neighbors, and the set of all 
pairs in the graph that are /c— neighbors is denoted by Sk- The eccentricity 
of a vertex u is the maximum distance of u from any other vertex in the 
graph. A central vertex of a graph is a vertex that has eccentricity smaller or 
equal to any other node. A network may have many central vertices, all of 
which are considered its centers. The eccentricity of a central vertex is called 
the graph radius. The graph diameter is defined as the maximum of the 
distances among all vertex pairs in the network. A tree is a graph in which 
any two vertices are connected by exactly one path. A cut is a partition of 
the vertices of a graph into two disjoint subsets. A cut set of the cut is the set 
of edges whose end points are in different subsets of the cut. A cut vertex of 
a connected graph is a vertex that if removed, (along with all edges incident 
with it) produces a graph that is disconnected. An edge is rewired when we 
change the vertices it is adjacent to. A single rewiring takes place when we 
change one of the vertices that is adjacent to it, and a double rewiring when 
we change both of them. A subgraph "H of a graph Q is called induced if 
V{'H) C V{Q) and for any pair of vertices u and v in V{'H), {u,v) G i^('H) 
if and only if {u,v) e S{Q). In other words, T-L is an induced subgraph of Q 
if it has the same edges that appear in Q over the same vertex set. Finally, 
two graphs Q and are called isomorphic if there exists a bijective function 
/ : V{g) V{n) such that 

(u, v) e EiG) ^ ifiu), fiv)) e Sin). (1) 

Two graphs that are isomorphic have by definition the same order and size, 
and are considered identical. 
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3 Networks with the Minimum and Maxi- 



mum Average Shortest Path Length 
3.1 Minimum Average Distance 

The average distance of a network is an important property, since it is a direct 
indicator of how different parts of the network communicate, and exchange 
information. A small average distance is a proxy for improved synchroniz- 
ability, efficient computation and signal propagation across the network [J]. 
In this section, we will analytically compute the minimum average distance 
of a graph of fixed order and size, and find sufficient conditions in order to 
achieve that minimum. 

Lemma 1. The average distance of a graph is a strictly decreasing function 
of its size. If we start with graph Q = Q{N, m) with average distance D{Q), 
and introduce one additional edge, the new graph Q' = Q'{N,m + 1) will have 
an average distance D{g') < D{g), for N - I < m < {^^) - I. 

Proof. The additional edge will connect two previously non-neighboring ver- 
tices s and t, changing their distance to d'{s,t) = 1. Since they were not 
connected before, their distance was d{s,t) > 2, so d'{s,t) < d{s,t). For ev- 
ery other pair of vertices u and v, the new edge can only create new shortest 
paths, so d'{u,v) < d{u,v). The total average shortest path length of the 
new graph is: 

DiQ') = -7L E d'{u,v)<-^ Yl d{u,v) = D{g). (2) 

□ 

Lemma 2. The star graph is the only tree of order N that has the smallest 
average distance equal to Dgtar — ~ Ji- 

Proof. A tree has exactly — 1 edges among its vertices. There will be 
exactly A^ — 1 pairs of vertices with distance d = 1, and ^) vertex pairs 
that are not connected, with distances d{u, v) > 2. The star graph achieves 
this lower bound, and has the minimum possible average distance. 

Dstar = ^ E ^'("'^) = A(^"^ + ^(^2'^)) 
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It is also unique: If a tree is not a star, there is no vertex that is connected 
to all the remaining vertices. In this case, there are at least two vertices with 
distance d > 3, since in every tree there is a unique path connecting each 
vertex pair, and at the same time the number of neighboring vertices is the 
same as in the star graph. □ 

Using the same method as above, we can find the smallest average dis- 
tance of a graph with vertices and m edges, which we denote as Dmin{N, m). 

Theorem 1. The minimum possible average distance of a graph Q{N,m) is 
equal to D„nn{N,m) = 2- 

Proof. The graph Q{N,m) has m pairs of vertices with distance exactly 1, 
and the rest of the pairs of vertices {u, v) have distances d{u, v) > 2. Conse- 
quently, its average distance is 



'" + 2 ((2) -'") _n '» 
(?) (?) 



This lower bound can always be achieved. A connected graph ^(A^, m) with 
at least one vertex with degree d = N — 1 has the star graph as an induced 
subgraph, so all non- neighboring vertices will have distance equal to 2. All 
connected vertices have distance equal to 1, leading to the lower bound of 
equation (4). □ 

Corollary 1. // a graph Q has at least one vertex pair {u,v) with distance 
d{u,v) > 3, then its average distance is Cg > Dmin{N,m) . 

Proof. The number of pairs with distance 1 is fixed, equal to the graph's size. 
All other vertices have a distance of at least 2, and the minimum is achieved 
when all non- neighboring pairs have distance equal to 2. □ 

The next three corollaries present sufficient conditions for a graph to have 
the smallest average shortest path length. 

Corollary 2. In a network with the smallest average distance, all vertex 
pairs are either connected, or connected to a common third vertex. 

Corollary 3. A cut of a minimum average distance graph Q divides its ver- 
tices into two disjoint sets where, in at least one of the sets, all vertices have 
at least one neighbor in the other. 
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Proof. Assume that in both sets of a graph Q there is at least one vertex 
which has no neighbors to the other set. The distance between these two 
vertices is at least 3, and according to Corollary 1, graph Q will not the 
smallest possible average distance. □ 

Corollary 4. Assume that graph Q of order N has the smallest average 
distance. The average degree cjj^ of the neighbors of vertex u with degree du 
satisfies the inequality 

> (5) 



Proof. Since every vertex u of Q has distance exactly 2 with all its non- 
neighbors, the vertices in its neighbor set Afu = {Vi, . . . ,Vd^} should be 
connected to all the remaining vertices. In other words, all the remaining 
N — 1 — du vertices of the graph should have at least one common neighbor 
with u. Each neighbor of u with degree Qk has Qk — 1 neighbors other than 
u, some of which may belong to A/'^. If we add up the neighbors of all these 
vertices excluding u, we get: 

J2(9k-l)>N-l-du 

k(^Nu (6) 
dnQj,^ >N-l 
N -I 

□ 

Corollary 5. Networks that have the smallest possible average shortest path 
length are disassortative with respect to their degrees. 



3.2 Maximum Average Shortest Path Length 

The networks with the largest average distance have a very different architec- 
ture. They consist of two distinct connected subgraphs, and if we remove any 
edge, the network either becomes disconnected, or the previously connected 
vertices become second neighbors. 
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Lemma 3. Assume that a vertex u with degree du is added to a network, 
with its neighbor set being A4- Rewiring edges of Q such that they connect 
previously non-neighboring vertices in Mu cannot decrease its eccentricity or 
the average distance of u with the other vertices in the network. 

Proof. Connecting any two vertices in Mu will not change the distance of u 
with any of them. Furthermore, disconnecting a pair of vertices, at least one 
of which is not in Mu can only increase the distance of u with any of the 
vertices that do not belong to Mu- □ 

More generally, connecting two non-neighboring vertices has the smallest 
impact on their average distance if they have a common neighbor. Rewiring 
an edge in Q will increase the distance of the initially connected pair 
to d, and decrease the distance of the new pair of vertices (^2,^2) with a 
common neighbor by 1. The overall difference will be 

M{Ui, Vi) + Ad(u2, V2) = {dnew{Ui, Vi) - dold{Ui, Vi)) 

+ {dnew{u2,V2) - dold{u2,V2)) 

=d-2 ^ ^ 

> 0. 

Combining Lemma 3 with equation (7), we can easily see that for a fixed 
neighborhood Mu of a vertex u, we can increase the eccentricity of u and 
at the same time the average distance of the graph it belongs to, simply by 
rewiring edges to connect vertices in Mu, until they form a clique. 

Lemma 4. All connected graphs of order N > 2 and size (^2"^) + 1 < m < 
(^) have the same average distance, equal to 

— Tfl 

D{N,m) = 2-jj^. (8) 
[2) 



Proof. Assume that the largest clique in Q consists of C vertices, that we will 
call central vertices. The rest of the nodes belong to the set V of peripheral 
vertices, with \V\ = P and they form connections to the central vertices or 
among themselves. Since m > (^2"^) ~^ 6very vertex in the graph is either 
a central or a peripheral vertex, and result 

C + P = N. (9) 
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The average distance of equation (8) is equal to the minimum possible dis- 
tance of a graph as in equation (4), and it is achieved if and only if all 
non- neighboring vertices have distance equal to 2. The only way that the 
network will not have an average distance equal to -Dmm is when there is a 
pair of vertices A and B with shortest path length of at least 3. If there exist 
two such vertices, then from equation (4) and Corollary 1 we conclude that 
the maximum average distance of the graph will be 

Dig)>2-^. (10) 

The central vertices are by definition fully connected to each other, and 
any peripheral vertex has distance two with all the central vertices it is not 
connected with. So, the only case where two non-neighboring vertices do not 
have any common neighbors is when both of them are peripheral vertices. 
We will now show that this is not possible. 

For every peripheral vertex u, there are central vertices that are not 
connected to it. Also, let h be the total number of non-neighboring peripheral 
vertices. The total number of non-neighboring vertex pairs is 

7 = h+Y,iu (11) 



with 



< 



= N -2. 

In addition, 

h > 1 (13) 
since A and B are not connected. Combining all the equations above: 

^ 7A + 7B+ Yl 7« < - 3. ^^^^ 



Every peripheral vertex in V has at least one central vertex that it is not 
connected to, so 

7n > 1 \/ueV (15) 

and 

lu>P-'2. (16) 

Based on the last two inequalities combined with inequality (14), we can 
derive an upper bound for the sum of 7^ and 7^: 

1A + 1B<N -P-l 

<N-3 ^ ^ 

because P > 2. But A and B have by assumption no common neighbors in 
the clique or among any peripheral vertices, which means that 

lA + lB>N-2 (18) 

which is clearly a contradiction. □ 

Corollary 6. There are exactly ^^-^ non-isomorphic graphs of order N 
and size m = (^2^^) with the largest possible average distance, equal to 

m — 1 

"(?)" 

All other graphs of the same order and size have the minimum possible aver- 
age distance among their vertices, equal to 

— Tfl 

D^i^{N,m) = 2--^. (20) 
[2) 

Proof. In a graph of size m = (^2^^), the total number of missing edges 
among all the pairs of vertices is 

-,.n-r:^)-iv-i. ,21) 



D^ax{N,m) = 2-^j^. (19) 



.2/ V 2 ^ 

Keeping the same notation as before, we add up all the missing edges among 
the peripheral vertices, and among peripheral and central vertices. 

h + ^A + lB+ Yl lu = N-l (22) 

u^A,u^B 



under the constraints 

1A + 1B>N-P, 5^ 7« > ^ - 2 and h>l. (23) 

These inequahties can only be satisfied in equation (22) if all variables are 
equal to their respective lower bounds, namely 

1A + 1B = N-P, ^7„ = P-2 and h = I. (24) 

MSP 

The only unknown variable above is P. Since A and B are not neighbors, 
and there is only one {h = 1) edge missing among peripheral vertices. If we 
assume that P > 3, then A and B have P — 2 common neighbors, all periph- 
eral vertices that are connected to both of them. This clearly contradicts our 
assumption. So A and B are the only peripheral vertices and P = 2. Such a 
graph is shown in Figure 1(b). It is clear from the previous analysis that 

dA + dB + -fA + lB = 2{N -2) =^ dA + dB = N -2 (25) 

with dA,dB > 1 because the graph is connected. Setting < in order 
to count only non-isomorphic graphs, it is clear that there are exactly ["^^^J 
pairs dA, ds that satisfy the last equation. □ 

Theorem 2. The graph of order N and size N - 1 < m < {^^^) with the 
largest average distance among its vertices consists of a complete subgraph of 
order C , and a path subgraph of order P = N — C . The two subgraphs are 
connected through a edges, as shown in Figure 1(a). In addition, the graph 
with the maximum average shortest path length is unique for N — 1 < m < 

(V) - 1- 

Proof. Every arbitrary cut 5* will produce two disjoint subgraphs, both of 
which need to be maximum distance graphs for the respective orders and 
sizes. More formally, if A is the set of all networks of all orders and sizes 
with the maximum possible average shortest path length and is an induced 
subgraph of a graph Q, then 

g eA ^ g -neA V H^g. (26) 
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(N-2)-Complcte Graph 



(a) (b) 

Figure 1: (a) The graph of order = 12 and size m = 24, with the largest average 
shortest path length. It consists of a complete graph of order C = 6 (blue), and 
a path graph of order P = N — C = 6 (green). Four edges (a = 4) connect the 
complete subgraph to one of the two ends of the path graph, (b) A network with 
size m = (^^^) and largest possible average distance. Vertices A and B are the 
only vertices without any common neighbors, and cIa + ds = N — 2, the number 
of central vertices. 



The above equation is a necessary and sufficient condition for maximum 
average distance. If it does not hold for some subgraph J7 C ^, then we 
would be able to rearrange the edges in it, so that the average distance among 
the vertices in the subgraph is increased. Since this would also increase the 
average distance of Q — J^ with the vertices of JT", the overall average distance 
of Q would increase. 

Now suppose that we want to find the maximum average distance graph 
of order A^. According to the equation above, and setting one of the vertices 
u as the chosen subgraph (of unit order), a graph with order and size m 
has the largest possible average distance (in which case it is denoted Qmax) 
when 



Gmax{N,m) 



arg max 

geciN,7n) 



^ d{u,v) 
(M,t))ev2(g) 



(27) 



where C{N, m) is the set of all possible connected graphs of order and size 
m. But from equation (26), and considering a subgraph H of order 1, we can 
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write the last condition as 



Gmax{N,m) = max g, 



max 



{N -l,m 



\Afu\)unii,Afu) 



(28) 



We will now find the neighborhood JV^ of vertex u in order to yield the 
graph with the largest average distance. We will use induction. For < 4, 
the theorem holds trivially. For order = 4, it is easy to check that graphs 
of all sizes have the structure of the theorem. 

Assume that all the maximum average distance graphs up to order A'o 
and size mo have the same form described above, where 



It will be shown that all networks of order A^ also have that same form, 
making use of equations (26) and (27). If du = 1, then we can connect it to 
the vertex w with the largest average shortest path length. In the resulting 
graph, u will now have the largest eccentricity and average distance to the 
other vertices. At the same time the new graph will have the form stated in 
the theorem and its average distance to other vertices will be 



If the degree of u is equal to the order of the clique, the resulting graph will 
have the largest average distance if we connect it to all the vertices of the 
clique, as shown in Lemma 3. If du is smaller than the order of the clique, 
then u could be connected to clique vertices only, path vertices only, or a 
combination of both. None of the above is an optimal configuration, since 
they do not satisfy condition (26). The same argument holds when du is 
larger than the size of the clique. In this case we can subtract the order of 
the clique C, and consider a new vertex with degree du — C, repeating the 
process if needed. According to the above analysis, the new graph will either 
have the form stated in the theorem, or it will not have the largest average 
distance. 

Finally for graphs with size A^ — 1 < m < (^2^^), the structure that yields 
the largest average distance is unique. Using induction again, we see that 
for A^ = 4, the claim holds. For N > 5, the graph with maximum average 




(29) 




(30) 



i>ev{g) 
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distance is unique for — 1 by the induction hypothesis, and adding one 
extra vertex u with du = 1 or du = C yields the same graph in both cases: 

GmaAN - 1, m - C) U nil, C) = GrnaAN - 1, m - 1) U mi, 1). (31) 

□ 

Note that according to condition (26), the network should have the same 
form no matter which subset of vertices we remove. The form of a graph 
with the largest average distance as stated in Theorem 2 is one that satisfies 
that requirement. 

The networks with the maximum average distance can be described as a 
combination of a type I almost complete subgraph [ ] and a path subgraph. 
Since the only type of almost complete graph in this study is type /, we will 
refer to it simply as an almost complete graph, to avoid confusion. 

We can now summarize the form of the networks with the largest average 
distance for any number of edges. 

Corollary 7. A graph Q{N,m) with the largest average distance consists of 
a clique connected to a path graph as described in Theorem 2 (see Figure 
1(a)) and is unique for N — 1 < m < (^2^) ~ ^- U ^ = (^2"^)' ^hen it 
consists of a complete subgraph of order N — 1 and a vertex with degree 1, or 
a clique of order N — 2 and two peripheral vertices as shown in Figure 1(b). 
If m > (^2^) + 1 then all graphs have the same average distance. 

Corollary 8. Networks with the largest average shortest path length are as- 
sortative with regard to their degrees. 

Note the difference between the networks with the smallest average dis- 
tance and the largest average distance. We can generally say that the average 
distance of a network is an increasing function of its assort at ivity. 

The order of the clique and path subgraphs in a network with the largest 
average distance is computed below. 

Corollary 9. The average shortest path length among the vertices of a net- 
work with the largest possible average distance Gmax{N,m) of order N and 
size m, is equal to 

^„„,,,,„,.(a±m±(g^^)^. ,3.) 

[2) 
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where 



3 + V9 + 8m - SAT 



is the number of vertices that belong to the clique, 

P = N -C 

is the number of vertices of the path subgraph and 

m- P+1- 



(33) 



(34) 



(35) 



is the number of edges that connect the clique with the path graph. 



Proof. We will find the lengths of the shortest paths among all vertices, add 
them, and finally divide them by their number to find the average. First, we 
need to find the order of the chque. Adding up all the edges of the network, 
we have 

a + (P - 1) = m. (36) 



Replacing P (total number of vertices is C + P = A^) , we get 

+ a+{N -C -1) 

where C and a are integers satisfying the inequalities 

1<C,P < N -1 

and 

l<a<C-l 

respectively. Solving for C: 

- 3C +(2N -2m + 2- 2a) 



0. 



(37) 

(38) 
(39) 

(40) 



One way to find the solution of the second order equation above, is to set a 
equal to its smallest possible value, and solve for C, keeping in mind that it 
is always a positive integer. As we add more edges, a is increasing with C 
staying unchanged, until the vertex of the path subgraph is connected to all 
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the vertices of the chque. At this point, C increases by one and a changes 
from C — 1 to 1. We set a = 1, and taking into account that C G N*, 



C 



3 + V9 + 8m - 8A^ 



(41) 



We can now compute the number of the vertices that do not belong to the 
chque, and the number of edges between the two subgraphs a from equation 
(36). 

The distance among each pair of the C vertices of the clique is 1, so the 
sum of the pairwise distances is 



(42) 



The sum of the shortest path lengths of the path subgraph vertices to the 
clique vertices is 



D2 = ^[xa + {x+l){C- a)] = ^[(C - a) + xC] 



x=l 



x=l 



P{C-a) + C 



P + 1 



(43) 



Finally, the sum of the shortest path lengths of nodes of the path subgraph 
is 

P X P x-l P / \ /Pii^ 

«3 = EEfe-) = EE-5:(;) = ( 3 

x=l y=l x=l z=0 x=l ^ ^ 

Adding all the sums of all the shortest path lengths, and dividing by the 
total number of vertex pairs, we get 



(44) 



Dmax{N, m) 



(45) 



□ 



It is easy to show that when m > (^2^^), the formula for the minimum 
and maximum average distance give the same result for the average distance, 
in accordance with Lemma 4. In that case, equation (32) assumes that the 
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200 400 600 

Network Size 



Figure 2: Tight bounds on the average distance of a graph with = 40 vertices 
and 39 < m < 780 edges. These bounds have been computed analytically. The 
average shortest path length for random graphs has been estimated by finding 
the mean shortest path length of 10^ randomly generated graphs of the same 
order and size. The expected average distance of a random graph is very close to 
the minimum, even for relatively sparse networks. For graphs with edge density 
p > 0.25, it is virtually identical to the minimum one. 



network is an almost complete graph, but this graph has the same average 
distance as any other graph of the same order and size. 

An example that shows the tight upper and lower bounds of the average 
distance of a graph with = 40 and 39 < m < 780 vertices is shown in 
Figure 2. 

4 Betweenness Centrality 

The betweenness centrality of a vertex or an edge is a measure of how im- 
portant this vertex or edge is for the communication among the different 
parts of the network. It is based on counting the number of shortest paths 
among all pairs of vertices a given vertex or edge is a part of [S]. The vertex 
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betweenness centrality is defined as 

where agt is the number of shortest paths between vertices s and t and o"st(u) 
is the number of shortest paths between s and t that go through vertex u. 
What equation (46) computes is the total number of shortest paths of all the 
pairs of vertices in the graph that go through a given vertex u. If there is 
more than one such path, we divide by their number, since they are assumed 
to be equally important. The betweenness centrality of a vertex is sometimes 
normalized by the total number of all vertex pairs that we took into account 
for computing it, which is equal to (^2^^)- 

'~>norm{U) — / j ' \^ ' / 

\ 2 ) (s,t)eV2(g) 

The vertex betweenness is always nonnegative. The only vertices with be- 
tweenness centrality equal to zero are the ones with degree equal to 1. In 
order to network, we find the average for all vertices: 

Networks with a large betweenness centrality usually have few vertices that 
play a major role in the communications among every other vertex. Con- 
versely, a small betweenness centrality indicates that all vertices are equally 
important or there are many different shortest paths among the various parts 
of the network. 

The edge betweenness centrality is similarly defined as the sum of the 
fraction of shortest paths of all vertex pairs in the network that go through 
a given edge: 

B{f)= Yl ^ (49) 

where in this case (Jst{f) is the number of shortest paths between s and t 
that go through edge /. The edge betweenness centrality of the network is 
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defined in the same manner as before: 



1 



(50) 



m 



The betweenness of an edge is always positive for a connected network. 

The betweenness ccntrahty of a graph is an important proxy of how robust 
the network is to random vertex or edge removals. Removing a vertex or an 
edge with large betweenness centrality means that the communication among 
many vertex pairs will be affected, since they will now be forced to exchange 
information through alternative, possibly longer paths. Graphs with large 
betweenness centralities are sensitive to random removal of a set of vertices 
or edges. The vertex or edge betweenness centrality of a graph does not 
give any information about the centralities of different vertices or edges, 
which may have large variation among each other. For networks with the 
same betweenness centrality, large variations among vertices or edges reveal 
a sensitivity to targeted attacks, since removing the most central vertices may 
significantly disrupt the network function. In this section we show that the 
betweenness centrality of a graph is inherently related to its average shortest 
path length. 

Theorem 3. The average betweenness centrality of a network Q{N,m) is a 
linear function of its average distance, 



{N - i){D{g) - 1) 



(51) 



2 
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Proof. 



9/V 



~ 2N ^ ^ ~st ^ 

itev(g) sev(g) tev{g) sev(g)tev(g) MeV(g) 

t^S Uj^t 

^ E E ^''«(i^(^-')i-i) = 2]v 5: E W'.') 

sev(g)tev{g) seV(g)tGV{g) 



1 

2N 



(52) 

Simplifying the last equation, the average betweenness centrality of a graph 
becomes the one stated in the theorem. □ 

It is worth mentioning that the average betweenness centrality of the 
network is only dependent on its size indirectly, through the average distance 
of the graph. For a fixed order, the average betweenness centrality of a 
network decreases as we add new edges (see Lemma 1). 

Corollary 10. A network has minimum (maximum) average betweenness 
centrality if and only if it has minimum (maximum) average distance. The 
minimum possible average betweenness centrality of a graph of order N and 
size m is equal to 

N — 1 m 

B^UG) = - ^ (53) 

and the maximum possible average betweenness centrality of such a graph is 

where C,P and a are defined in equations (33), (34) and (35) respectively. 

Proof. The networks with the smallest or largest average betweenness cen- 
trality are exactly the graphs with the smallest or largest average distance 
respectively. Replacing them from equations (4) and (32), the bounds for 
the average betweenness centrality of graphs follow. □ 
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Corollary 11. The minimum sum of betweenness centralities of all the ver- 
tices of a network is equal to the number of vertices that are not neighbors. 

Proof. From equations (48) and (53), we see that 



mm 

g&C{N,m) 



ueViG) 



N ■ BrrUG) = ( ^ ) - ^- (55) 



2 



□ 



Theorem 4. The average edge betweenness centrality of a network is directly 
proportional to the average distance of the network, equal to 

The minimum and maximum average edge betweenness centrality of a net- 
work of order N and size m are respectively 

BLniN,m) = ^^^-^^ -1 (57) 
m 

and 

m 

where C,P and a are the same as in equations (33), (34) and (35). 

Proof. We follow the same method as in the proof of the vertex betweenness 
centrality: 

Se(^) = - E ^(-) = - E E ^ 

TTl Tfl (J i 

{s,i)GV2(g)ee£-(g) {s,t)ev2(g) ee£-(g) (59) 



— —astd{s,t) 

m < ^ rr . 



{s,t)ev2(g) 

Replacing the average distance by its minimum and maximum bounds, we 
get equations (57) and (58) respectively. □ 
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5 Efficiency 



The efficiency of a network (as defined in ['>]) is a metric that shows how fast 
a signal travels on average in the network, assuming constant speed from one 
vertex to another. It is the sum of the inverse distances of all vertex pairs in 
a network, normalized by the total number of such pairs: 

Network efficiency is also correlated with the fault tolerance of the net- 
work, in the sense of how the average distance of a network changes when 
one or more vertices are removed from the network. It is has been used to 
assess the quality of neural, communication and transportation networks [5]. 

Below we are going to show that the most and least efficient networks 
are the ones with the smallest and largest average distance among their 
individual parts. 

Theorem 5. A graph Q = Q{N, m) has the highest efficiency among all other 
graphs with the same order and size if and only if it is a graph of minimum 
average distance. The highest efficiency of a network of N vertices and m 
edges is equal to 

1 m 

'^-(^'-) = 2 + iV(iV^- (^1) 

Proof. We assign a distance matrix to every graph, with its (fc, m) element 
being the distance between vertices k and m. For a graph Q = QN,m with 
distance matrix D and the minimum average distance, the sum of all the 
distances among all the pairs of vertices is smaller or equal to that of any 
other random graph TZ = TZN,m with distance matrix H. 

dkm < hkm- (62) 

k<m k<m 

The function to be maximized is convex, which means that the maximum lies 
on one of the boundaries. Since we will be comparing only networks of the 
same order, we will focus on the sum of inverse distances among the vertices 
of each network. 

^\<3)={T]H<3)- (63) 
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If a network TZ is not a minimum average distance graph, then according to 
Corollary 1 there exists at least one pair of vertices (a, b) with d{a, b) > 3. 
The sum of the inverse shortest path lengths of such a network is 



u^v (64) 
+ ^l^2(7^)| + $^i|^,(7^)| 



k>3 

On the other hand, the sum of the inverse distances of a minimum average 
distance network Q is 

^-^y ™ (65) 

The difference is therefore 
T'iS) - T'CR.) = Qm+ i(^)) - U + + g 



■m - |£:2(7e)| 



6 

A;>3 

> 0. 

(66) 

This shows that a maximum efficiency graph is a minimum distance graph. 
Normalizing by the total number of vertex pairs, equation (61) follows. □ 

Theorem 6. A network has the lowest possible efficiency if and only if it is 
a largest average distance graph. 
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Proof. We will use the same method as in the proof for the form of networks 
with the largest average distance. When {^-^) + 1 < m < (^) , then all 
networks have m pairs of connected vertices, (^) — m pairs of vertices that 
are second neighbors, and there is no graph in which two vertices do not have 
any common neighbors, as shown in Lemma 4. This clearly shows that all 
networks of this size have the same efficiency, given by equation (61). For 
smaller size graphs, when A^— 1 < m < (^2"^)' ^ necessary and sufficient 
condition will be 

g ^ g -Uel V ncg. (67) 

with I being the set of networks with the lowest efficiency. If we consider a 
subgraph of order 1 (a single vertex), its average distance to all other vertices 
will be the largest when its degree is equal to 1. So, if ^ = "H U {u}, with 
T-i &X and u is only connected to one other vertex in the graph (its distance 
to which is equal to 1), it is evident that it has to be connected to one of the 
vertices with the largest average distance, which at the same time has the 
largest eccentricity. 

fcev(g) 

(68) 



The last equation shows that if v is the vertex of "H with degree d^, = 1, then 
the new graph has the smallest possible efficiency. □ 



6 Radius and Diameter 

The radius and the diameter of a graph are also measures that have to do 
with distance. In order to define the radius of a graph, we need to find a 
central vertex in the network, the one that is the closest to all other vertices. 
A network may have more than one central vertex. We are often interested 
in the radius of a network when information is aggregated and distributed 
from a vertex high in the hierarchy to other vertices lower in the hierarchy. 
The importance of a node is correlated with how central it is. Important 
vertices are usually the ones closest to the network center. 



23 



On the other hand, the diameter of a network becomes important when 

we have a flat hierarchy, where communication or signal propagation takes 
place with the same frequency among any given pair of vertices in the net- 
work. There are applications in which we want our network to have very 
small or very large diameter. Usually for signal propagation or in general 
diffusion phenomena, the desired network architecture has the smallest pos- 
sible diameter, since the response to different inputs needs to be processed as 
fast as possible. When considering a virus spreading in the network during 
a fixed time interval, in order to ensure that as few nodes as possible get 
infected before appropriate action is taken, the network diameter has to be 
as large as possible. 

Here, we are going to show the structure of the networks with the largest 
and smallest radius and diameter. As we will see below, these graphs do not 
always have the same form. 

6.1 Networks with the Smallest and Largest Radius 

In this section, we will find tight bounds for the radius of graphs of arbitrary 
order and size. The networks that achieve these bounds are not generally 
unique. The radius of a network is correlated with its average distance and 
diameter. Graphs with the smallest radius have the smallest average distance 
and smallest diameter, whereas graphs with the largest radius may or may 
not have the largest average distance or diameter, as we will see next. 

Lemma 5. If {u,v) e S{Q), then 

ecc{u) — 1 < ecc{v) < ecc{u) + 1. (69) 



Proof. For every vertices u,v such that {u,v) G £{Q)-i and w e V(^), 

\d{v, w) — d{u, w)\ < 1 
=^ d{u, w) — 1 < d{v, w) < d{u, w) + 1 

=^ max \d(u,w) — 1} < max d(v,w) < max \d(u,w) + 1} ^'^^^ 
wev{g) wev{g) wev{g) 

=^ ecc{u) — 1 < ecc{v) < ecc{u) + 1. 

□ 
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Theorem 7. A network of order N and size m has the smallest possible 
radius if and only if it has an induced subgraph which is the star graph. Such 
a network has a radius equal to one, regardless of its size. 

Proof. The radius R{Q) of any graph ^ is a natural number, with R{Q) > 1. 
If a star of the same order as Q is an induced subgraph, then the central vertex 
has eccentricity equal to one, which is the minimum possible. Conversely, 
if the radius is equal to one, then there exists at least one vertex with full 
degree, which, along with its neighbors forms a star subgraph. □ 

Corollary 12. A network with the smallest radius also has the smallest 
average distance among its vertices. The opposite is not necessarily true, 
since there exist minimum average distance networks with no vertices of full 
degree (Corollaries 2-4)- 

Lemma 6. The maximum radius of a graph Q{N,m) is a nonincreasing 
function with respect to the size m. 

Proof. Adding an edge to any graph Q {N, m) will create a shorter path be- 
tween at least two vertices, so the eccentricity of every vertex in Q is either 
unchanged or decreases. □ 

Lemma 7. Assume that Q{N,m) has radius R{G) = r, and c is a central 
vertex. If d{a, c) = r for some a G V{Q), then there exists a vertex b G V(^) 
such that 

d{b,c)>r-l and 'P{a,c) nV{b,c) = c. (71) 



Proof. Suppose that there does not exist such a vertex. If the first condition 
is not satisfied, then 

d{u, c) < r - 2 \/ue V{g) =^ R{g) < r - 1 (72) 

which contradicts the assumption that Q has radius r. 

If there do not exist any vertices a and b with distances at least r and 
r — 1 respectively from c whose shortest paths to c have no other common 
vertex, then there exists a different vertex w G V{a, c) fl V{b, c) such that 

d{a,w)<r — l and d{b,w)<r — 2 (73) 

meaning that c is not a central vertex. □ 
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Lemma 8. A path graph has a radius larger or equal to any other tree net- 
work, 

N 

Rmacc{N,m = N-l)=\^j^. (74) 

A cycle graph has radius larger or equal to any other network, Rmax{N, m = 
N)=RmaAN,m = N-l). 

Proof. A network Q with radius R > [yj + 1, according to Lemma 7 will 
need to have an order of 



\v{g)\ > 1 + 1 + 



N 
~2. 



+ 



N 
~2. 



>N + 1 



(75) 



which is a contradiction. If the path graph has an odd number of vertices, 



the central vertex is the middle vertex, with distance 



N-l 



from both extreme 



vertices. If the order is even, then both middle vertices are graph centers. 



and their eccentricities is equal to Connecting the two vertices that are 
furthest from the center through an edge does not have an impact to the 
graph radius, so a cycle has the largest possible radius (Lemma 6). Because 
of the symmetry of the network, all vertices have the same eccentricity. □ 

Lemma 9. A graph of order N and size (^) — [^1 + 1 < "w- < (^) has 
radius equal to 1. 

Proof. It suffices to prove that there exists at least one vertex with full degree. 
If all vertices have degree less than N — 2, the graph size is at most 



m < rriQ 



iV(iV-2) 



(76) 



which is not possible, since for all > 2 



+ 1 > rriQ. 



(77) 



□ 



Lemma 10. The largest possible radius for a graph of order N and size 



is equal to 2. 



26 



Proof. It suffices to prove that graphs with size m > (^^^) + 1 cannot have 
radius of 3 or larger, since we can find at least one network of this size in 
which no vertex has full degree [10]. Refering to Figure 3(a), let C be one of 
the central vertices. According to Lemma 7, there exist at least two vertices 
A and F (possibly connected to each other) such that 

d{C, A)>2 and d{C, F) > 3. (78) 

In order to respect these distance conditions and the centrality of C, 

C-/-F, B-/-E and A-/-D. (79) 

Because d{C, F) > 3, nodes C and F cannot have a common neighbor, so 
there are N — 2 edges that are not present in the graph. In addition, B and 
E may not have a common neighbor either, otherwise the radius would be 
at most equal to 2. There are another — 4 edges that cannot be present, 
since there are N — 2 possible common neighbors of B and E, and we have 
already counted two of them in the previous case. So the graph is missing at 
least rris = (A^ — 2) + (A^ — 4) + 3 edges, and its size is at most 

m<(^)- (2JV - 3) 

N- 2^ f^"' 



2 



□ 



Lemma 11. The maximum possible radius of a graph of order N and size 
N+l<m< (\^) -2 IS 



Rmax{N,m) 



2N -1- VI + 8m - 8A^ 



Proof. For every A^ > 3, there is at least one graph with radius equal to 
Rmax{N,m) that includes a full circle as an induced subgraph. To see why, 
suppose that C is a central vertex and 



d{C,A) = R^acc and d{C, F) > R^,, - 1 A,FeV{g) 



(82) 
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(a) (b) (c) 

Figure 3: (a) Necessary induced subgraph for a network to have radius larger or 
equal to 3. The vertices shown are not allowed to have any direct connections, 
and vertices with distance 3 are not allowed to have any common neighbors, (b) 
For order N and size m > N, there is always at least one graph ^(A^, m) that has 
the maximum possible radius, and has a full cycle as an induced subgraph, (c) 
A network with largest possible radius with N = 7 and m = 11. It consists of an 
almost complete subgraph of order M = 5 and a path graph of order L = 4. The 
two induced subgraphs share two vertices, so that N = L + M — 2. The red vertex 
has the largest degree, and is the center of the network. 

as shown in Figure 3(b). We pick vertices B and D such that 

d{C, B) = 1 d{C, D) = 1, and B G V{C, A),G e V{C, F). (83) 

Also, all other vertices can be connected to B, D and C without changing 
the radius, when m < (^^^) — 2. Vertices A and F can be connected without 
changing the maximum radius of the network, as shown in Lemma 8. Also, 
there are no edges among vertices in V{C, A) or vertices that belong to 
V{C,A), otherwise condition (82) would not be satisfied. Thus, the network 
described has a full circle as an induced subgraph. 

We now need to compute the maximum radius of such a graph. Since 
a new edge always creates new shortest paths, we have to connect vertices 
with distance equal to two, such that we only create a single new shortest 
path between vertices that are second neighbors (see also equation (7)). In 
other words, a simple method to find the graph with the largest possible 
radius is to start from a cycle graph, and keep adding edges such that we 
have a complete or almost complete graph connected with both ends of a 
path graph, as shown in Figure 3(c). This process is the same as the one 
for finding networks that minimize crosstalk among individual elements [9], 
with the only difference being that the initial graph is a cycle instead of a 
star. Based on the symmetry of this type of network, we can assume that 
the vertex with the largest degree is always a central vertex. If we denote 
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with M the order of the complete or almost complete graph, and with L the 
order of the path graph, we can find these orders by solving the following 
system of equations: 



m = (L - 1) + 
L + M = N + 2 



M -1 
2 



a 



with 



l<a<M-l, L,CgM*. 



(84) 



(85) 



We compute M (and subsequently L) in the same way as in the proof of 
Corollary 9, by setting a equal to its maximum value, and then choosing 
the smallest integer M that is smaller than the solution of the second order 
equation. 



M 



3 + VI + 8m - 8iV 



L = N + 2- 



3 + VI + 8m - 8N 



(86) 



This graph has radius equal to 

L + 6{a<M -1) 



Rmax{N, m) 



N + 2 + 6{a < M -1) - [ 



3+Vl+8m-8N 



(87) 



where S is the Kronecker delta function. After simplifying, the last expression 
becomes: 

^2N - 1 - VI + 8m - 8N 



RmaxiN.m) 



(88) 



□ 



Theorem 8. The maximum radius of a network of order N and size m is 



Rmax{N,m) = < 



-I 

2N-1-Vl+Sm-8N 
4 



2 
1 



- 1 < m < 
A^ + 1 < m < (^2~^) 
(--) + 1 < ^ < (-) - [f ] 
(-)-[fl+l<m<(^). 



(89) 
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An example of the form of the function above for = 40 and all sizes is 
shown in Figure 4. Note the same stair-like form of both the maximum radius 
and the statistical averages. The statistical average curve exhibits fewer and 
smoother "steps". Networks with the largest radius of order N = 5 and all 
sizes are listed in Figure 5. 




200 400 600 

Network Size 



Figure 4: The largest, smallest and average radius of graphs of order = 40 as 
a function their size 39 < m < 780. The minimum and maximum bounds are 
analytically computed. The statistical averages are estimated by the mean radius 
of 10^ random graphs for each size. The average radius also has a "stepwise" form. 
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(f) m=9 (g) m=10 



Figure 5: All connected networks with 5 vertices and 4 < m < 10 edges with the 
largest possible radius. The first graph of each group is constructed by the method 
described in the text. 

6.2 Networks with the Smallest and Largest Diameter 

In this subsection, we are going to study the form of the networks with the 
minimum and maximum diameter. Computing the minimum diameter of 
a network is fairly straightforward. In the case of the maximum possible 
diameter, we first prove two lemmas that will help us show that we can find 
the structure of the networks recursively. 

Theorem 9. A network has the smallest possible diameter if and only if it 
is a smallest average distance graph. 

Proof. The diameter of a complete graph is trivially equal to one. If the graph 
is not complete, the diameter is at least 2, since there is at least one pair 
of non-neighboring vertices. In a graph with the smallest average distance, 
all vertices that are not connected have at least one common neighbor, and 
the maximum eccentricity is equal to 2. Conversely, if the largest distance 
among any vertex pair is equal to 2, then by Corollary 1, the graph has the 
smallest average distance. □ 
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Corollary 13. A network with the minimum radius (R{Q) = 1 j also has 
minimum diameter (T{Q) = 2) regardless of its interconnection topology. 
The inverse is not always true: There are networks with minimum diameter, 
and radius R{Q) = 2 > Rmin{N,m). 

Lemma 12. A network of order N and size (\^) + l<m<{^^)-lhas 
a diameter equal to 2. A complete graph has diameter equal to 1. 

Proof. In a complete graph, all vertices are connected to each other, so the 
eccentricity of every vertex is trivially equal to 1. In a graph of size m > 
(^2^) +1, all vertices that do not share an edge have at least one common 
neighbor, as shown in the proof of Lemma 4. Consequently, every vertex has 
eccentricity either 1 or 2, so the diameter is equal to 2 regardless of the graph 
topology. □ 

Lemma 13. The largest possible diameter T^axiN, m) of a network of order 
N is at most one larger than the largest possible diameter of a network with 
order N — 1 and smaller size. 

TmaxiN, m) <T^ax{N -l,m-d) + l, l<d<N-l. (90) 



Proof. Assume that the graph Q = QN,m has diameter T{Q) < Tmax{N,m). 
Define as V the set of unordered vertex pairs whose distance is equal to the 
graph diameter. We now remove an arbitrary vertex u with degree d = du 
from Q, and the resulting graph is H with order — 1. If u G P, then no 
shortest path between any vertex pair in V passes through u. We distinguish 
two cases: 

• If M is in every vertex pair in V, then the diameter of "H is 

Tig) = Tin) + i yg,n 

=^ T^axiN, m) < TrnaxiN - 1, m - rf) + 1. ^ ' 

• If there exists at least one vertex pair in P that does not include u, 
then removing u will result in a graph H that has the same diameter 
as Q. 

T{G) = Tin) < T^UN - l,m - d) ^^^^ 

=^ Tmax{N, ni) < Tmax{N - 1, m - rf). 



32 



Combining the two cases, the result follows. □ 

Corollary 14. // we remove a vertex u from a graph Q , resulting to graph 
Ti, then 

Tn=Tg-l =^ Vg = {{u, wi), (m, W2),..., (u, Wd)}. (93) 
Conversely, if we add a vertex u with degree d to H, then 

Tg = T-u + l =^ {{vi,wi),{v2,W2),. . . ,{yd,Wd)} ^V-u (94) 
with wi, . . . ,Wd,vi, . . .Vd e V{'H),V{Q) and {u,vi), . . ., {u,Vd) G S{Q). 

Theorem 10. The largest possible diameter of a network of order N and 
size m is equal to 



T^axiN,m)=N 



1 + V9 + 8m - 8A^ 



(95) 



Proof. Lemma 13 and Corollary 14 readily show an easy way to find the 
largest possible diameter of a graph of fixed order and size. According to 
Corollary 14, if we add a vertex u with degree (i„ = 1 to a maximum diameter 
graph "H, and we connect it to a vertex with the largest eccentricity, the 
resulting graph Q has also the largest diameter for its order and size (see also 
Lemma 5). As a result, we can write the following recursive relation: 

T^,,(iV,m) = l + T^,,(Ar-l,m-l). (96) 

Repeating the process as many times as possible, 

TmaxiN,m) = k + Tmax{N - k,m- k). (97) 

The only reason why we cannot continue the recursion relations further is 
that there cannot exist a network with N — k — 1 vertices and m — k — 1 
edges, simply because 

m-k-l> (98) 

This means that the subgraph "H with N — k vertices and order m — k has 
size 



2 < < ( 2 ) 
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0^ 

200 400 600 

Network Size 

Figure 6: The smallest, average and largest diameter of graphs of order = 40 
as a function of their size, 39 < m < 780. The minimum and maximum bounds 
are analytically computed. The statistical average is approximated by the mean 
diameter of 10^ random graphs of the respective size. Note that even in the average 
diameter, there is the "stepwise" form of the maximum diameter graphs, although 
much smoother. 



and consequently has a diameter of 2, so the maximum diameter of the graph 
is: 

Tmax{N,m) = k + 2 (100) 

where k = N — N-^. One of the graphs with the maximum diameter consists 
of a path graph with N — N^i vertices, and a subgraph of order N-^ and size 
> (^^~^) +1. If we assume without loss of generality that "H is a type / 
almost complete graph, then ^ is a maximum distance network. It consists 
of a path graph of order P = k + 1 and a complete graph of order C, such 
that P + C = A^, as shown in Figure 1(a). If we denote by C and P the 
order of the clique and the path subgraphs as in equations (33) and (34), 
and combine them with equation (100), the maximum diameter of a graph 
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^(A^, m) is 



Tmax{N, m) = k + 2 = P+l = N -C + 1 
= N- 



1 + V9 + 8m - 8A^ 



(101) 



□ 



Corollary 15. A maximum average distance graph also has the largest pos- 
sible diameter. The converse is not always true. 

An example of the minimum, maximum and average diameter of graphs 
with 40 vertices and increasing number of edges is shown in Figure 6. 

From the previous analysis we can conclude that the only tree with the 
largest diameter is the path graph. A graph with the largest diameter is not 
necessarily unique for any size < m < (^) - 2. The list of all the graphs 
of order N = 5 and size 4 < m < 10 are shown in Figure 7. 

Comparing the graphs with the largest radius with the graphs with the 
largest diameter, we find the following counterintuitive result: 

Corollary 16. A network with the largest diameter does not necessarily have 
the largest radius. Conversely, a network with the largest radius does not 
necessarily have the largest diameter. 

An example is given in Figure 8. 
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(f) m=9 (g) m=10 



Figure 7: All connected networks with 5 vertices, 4 < m < 10 edges and the largest 
diameter. The first graph of each group is also a maximum average distance graph. 




(a) (b) 

Figure 8: (a) The only graph with {N, m) = (6, 6) and radius Rmaxi'^-, 6) = 3. (b) 
All graphs with the same order and size and maximum diameter, equal to 4. No 
graph of the largest diameter coincides with the maximum radius network. 

7 Resistance Distance 

The resistance distance between two vertices of a graph is equal to the to- 
tal resistance between two points of an electrical network, with each edge 
representing a unit resistance. 

Theorem 11. The smallest possible resistance distance of a simple connected 
graph Q = QN,m is 

r, ^.r ^ f — WT^ N-l<m<2N-3 
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Proof, lim — N — 1,Q is & tree, and it is not possible to have any resistors 
connected in parallel, so S„iin{N,N — 1) = 1- Every tree is a minimum resis- 
tance distance graph as long as the endpoints of the circuit are two adjacent 
vertices. When N < m < 2N — 3, the network will have c = m — N + 1 
independent cycles. In order to make the resistance as small as possible, we 
need to choose the endpoints of the circuit to be two adjacent nodes, and in 
addition to be parts of cycles as short as possible. This is because the smaller 
the resistance of each branch of the cycle, the smaller the resistance between 
the two endpoints. Since a cycle has at least 3 vertices (two vertices cannot 
be connected with more than one edge, since the graphs are assumed to be 
simple), the cycles need to be of length 3, and all of them have a common 
edge, the endpoints of which are the endpoints of the circuit. As a result, 
the total resistance will be the combination of a unit resistor in parallel to 
m — N + 1 pairs of resistors connected in parallel, which means that 

SminiN,m)^ ^ = 1— -, N-l<m<2N-3. 

l + ^{m — N+l) m — N + S 

(103) 

Adding more edges to the network has no effect on the total impedance, since 
all vertices except for the circuit endpoints will have the same potential, equal 
to half of the voltage difference applied to the circuit ends (assuming that 
we have arbitrarily set the potential of one of the endpoints equal to zero). 
Consequently, no current would flow among them, and we can write 

1 2 

SminiN, m) = T- ^ = — , m>2N -3. (104) 

^ ' ' 1 + ^{{2N -3) - N + 1) TV' - ^ ^ 

□ 

Corollary 17. A graph with size N — 1 < m < 2N — 3 has minimum 
resistance distance if and only if it has a subgraph consisting of m — N + 1 
triangles that have one edge in common. The endpoints of the common edge 
are the endpoints of the circuit. For 2N — 2 < m < (^) , any graph with at 
least two vertices of full degree is a minimum resistance distance graph. 

We now turn our attention to the structure of the networks with the 
largest resistance distance. We will start from networks of relatively large 
size (almost complete graphs) and then move on to finding their form in the 
general case. 
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Lemma 14. The maximum resistance of a network of fixed order is a de- 
creasing function of its size. 

SmaxiN, m) > SmaxiN, 171 + g) for g > (105) 



Proof. If Smax{N, m) < Smax{N,m + g), we remove g edges from the network 
of larger size. This cannot decrease the total resistance, since there is now no 
voltage drop between the vertices that were previously connected. Therefore, 
the resulting graph will have total resistance larger than the resistance of the 
initial network of smaller size, which contradicts the hypothesis. □ 

Lemma 15. The largest possible resistance distance of a network with m > 

iV) + 1 

SmaxiN, m) = ^^-^^ (106) 

where 

C = N ~ 1 and a = m — 

Proof. The almost complete graph will have the largest possible resistance 
when the endpoints of the circuit are the peripheral vertex and a central 
vertex it is not connected to. The reason is that any other graph will have 
more shorter paths to the ground vertex, and thus smaller total resistance 
[11]. In the almost complete graph, because of the symmetry, all the vertices 
that are neighbors of the peripheral vertex have the same potential. Similarly, 
all vertices that are not neighbors of the peripheral vertex, except for the 
ground vertex also have the same potential. The ground vertex has zero 
potential. Thus, we may remove the edges between the a neighbors of the 
peripheral vertex, and the N — 2 — a non-neighbors of the peripheral vertex, 
excluding the ground vertex. Then, we merge the edges that connect these 
three sets of vertices, as if they were connected in parallel. The result is 
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shown in Figure 9. The total resistance of the transformed network is 




(a) (b) (c) 

Figure 9: (a) An almost complete graph has the largest resistance distance among 
all other graphs of the same order and size. The endpoints with that resistance are 
the peripheral vertex (green) and an arbitrary central vertex that is not connected 
to it (red). The central vertices that the peripheral vertex connects to (blue), all 
have the same potential. All central vertices that are not connected to the periph- 
eral vertex (yellow) also have the same potential, (b) Taking into consideration 
the symmetry of the circuit, we can remove the edges among vertices that belong 
to the same group, (c) The previous circuit can be simplified by collapsing all 
nodes of one group to one "super-node" , and considering all resistors that connect 
different groups to be connected in parallel. The weights of the edges correspond 
to the resistances among the "super-nodes" . 

Lemma 16. The maximum resistance of a network Q{N,m) can be at most 
one larger than the maximum resistance Smax{N — l,m — 1), provided that 
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both networks are connected. More precisely 

SmaAN,m)<S^aAN-l,m-l) + l. (109) 



Proof. We start from the network 'H(A^— 1, m — 1) with the largest resistance, 
and add one extra vertex with unit degree to one of its endpoint vertices. If 
the Lemma does not hold, then the resulting graph Q = Q{N,m) will have 
resistance 

SiG) = 1 + Sma^N - 1, m - 1) < SrnaAN, m). (110) 

Now assume that the network }C with the maximum resistance has vertex 
u with degree du as an endpoint. li du = 1, then network T-L was not a 
maximum resistance network and the Lemma is proved. If (i„ > 1, then the 
potential difference between u and all its neighbors will be smaller or equal 
to one, since the current flowing through u will be divided among all the 
resistors that are adjacent to u. Consequently, removing vertex u along with 
its edges, we can find a relation with the resulting network Ai 

Smax{N,m) <e + S{M) < e + 5^a,(iV - 1, m - rf„) (111) 
with < e < 1. Combining equations (110) and (111), it is evident that 

S^aAN - 1, m - 1) < SmaAN - 1 , m - - (1 - c) (112) 

which, according to Lemma 14, is not possible. □ 

Theorem 12. The largest possible resistance distance of a simple connected 
graph Q = QN,m is 

Sma.{N, m) = P-l + ^^^^^ (113) 

where C,P and a are defined in equations (33), (34) and (35) respectively. 

Proof. Using the Lemmas above, we can analytically compute the largest 
resistance, and find the form of the graphs that have it. We repeatedly apply 
Lemma 16, until we are left with an almost complete graph, at which point 
we can make use of Lemma 15. 

Smax{N,m)<k + S„,aAN-k,m-k). (114) 



40 



We can achieve the equahty in the last equation by connecting a new vertex 
with unit degree to one of the endpoints of the previous graph, such that 

k = P-l and N-k = C+l (115) 

where the total resistance is the sum of a path graph with P vertices, and 
an almost complete graph with the respective values of C and a (see also 
condition (98)). The form of these maximum resistance networks are easy 
to identify: We recursively connect in series a unit resistor in one of the two 
endpoints of the previous circuit. □ 

An example showing the minimum, average and maximum resistance for 
networks of order = 40 and increasing size is shown in Figure 10. 
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200 400 600 

Network Size 



Figure 10: The largest, smallest and average resistance of graphs of order = 40 
and size 39 < m < 780. The minimum and maximum bounds are analytically 
computed. The average resistance of each graph is approximated by the mean of 
10^ random graphs of the respective size. The statistical average of a network of 
relatively small size is very close to the minimum resistance. 
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Corollary 18. The number of non-isomorphic graphs with the largest resis- 
tance distance is 



Is (N, m) 



P + l 
P+i 



a = 1 or a = C 
2<a<C -2 



(116) 



where C,P and a are defined in equations (33), (34) and (35) respectively. 

Proof. The order in which we place linear resistors in series has no effect on 
the total resistance of the circuit. We can serially place ni and n2 resistors 
at each side of the almost complete subgraph, such that 



rii + n2 = P with rii > and n2 > 0. 



;ii7) 



The last equation has P+l solutions. When a = lora = C— 1, then 
the almost complete graph is symmetric, which means that we count every 
non-isomorphic graph twice. Adjusting for this special case the number of 
non-isomorphic graphs, we get the desired result. □ 

Corollary 19. The networks with the largest average distance have the largest 
resistance distance. The converse does not always hold. 



8 Sensitivity to Rewiring 

In most cases when designing a network, there are many constraints that 
need to be satisfied, and the properties discussed so far are only proxies to 
determining other desirable qualities of a network. Also it is reasonable to 
expect that our networks may not be allowed to have the extremal values of 
the properties described in the previous sections, especially given that some- 
times there are conflicting requirements for the network function. Under 
these considerations, we are interested in knowing how robust these struc- 
tures are, in other words, how sensitive these properties are to changes in the 
interconnection patterns. 

To this end, we can start with two networks with the extremal prop- 
erties, and keep rewiring one of the edges, making sure that the network 
remains connected. At every step, we measure the properties of each net- 
work instance, and monitor its evolution as we introduce more and more 
randomness in their architecture. Eventually, both will resemble a random 
graph, with the average distance, betweenness centrality, efficiency, radius. 



42 




100 



200 300 
Number of Rewirings 



400 



500 



(a) 



500 r 



400 



E300 



re 200 



100 



200 



400 600 
Number of Rewirings 



800 



1000 



(c) 



250 r 



200 



! 150 



100 



50 



200 



400 600 
Number of Rewirings 



800 



1000 



(b) 



200 



1 150 



olOO 



50 



200 



400 600 

Number of Rewirings 



800 



1000 



(d) 



Figure 11: Evolution of the average distance, radius, diameter and resistance of 
networks after successive random rewirings. Each curve is the average of 100 
experiments, rewiring networks of the same size and order N = 500. The curves 
for the average shortest path length correspond to a network of size m = 600. The 
curves for the evolution of radius, diameter and resistance correspond to networks 
of size m = 525. 



diameter and resistance being close to the statistical average. The question 
though is how fast this state is reached. If the change is very fast, this means 
that a few changes in the structure are able to negate the advantages that 
an extremal graph is able to provide. However, if a particular property does 
not change much after several rewirings, we can afford to build networks 
with many other desirable properties without the need to follow exactly the 
specific architecture for every given property. 

We have tested the change of average distance, radius, diameter and 
resistance for networks of small order. Refering to Figure 11, we find that 
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the average distance of networks with large average shortest path length 
decreases very quickly after a few rewirings. Convergence to the statistical 
average is much slower for networks that start with the smallest average 
distance. The number of rewirings required to get close to the statistical 
average is an increasing function of the order of the network when the edge 
density is constant, as one would expect, since there are more edges that need 
to be rewired in order to make a difference. The same general conclusions can 
be drawn for the evolution of the network radius, diameter and resistance. 

The general conclusion is that networks with the smallest average dis- 
tance, radius and diameter and resistance are much more robust to changes 
in their architecture because they rely on a global pattern of interconnections, 
and each edge has a small role in ensuring that property, so its conservation 
is diffused among many edges. On the contrary, networks with maximum 
distance, radius, diameter and resistance are very sensitive to change, be- 
cause most vertex connections are prohibited, in the sense that if currently 
non-neighboring vertices are connected, there is a dramatic change towards 
the statistical average. 
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